261M
261M
collection
eye 261M
This is a collection of web page captures from links added to, or changed on, Wikipedia pages. The idea is to bring a reliability to Wikipedia outlinks so that if the pages referenced by Wikipedia articles are changed, or go away, a reader can permanently find what was originally referred to. This is part of the Internet Archive's attempt to rid the web of broken links .
Topics: Wikipedia, Wikimedia
133.8M
134M
collection
eye 133.8M
This is a collection of pages and embedded objects from WordPress blogs and the external pages they link to. Captures of these pages are made on a continuous basis seeded from a feed of new or changed pages hosted by Wordpress.com or by Wordpress pages hosted by sites running a properly configured Jetpack wordpress plugin.
Topics: Wordpress.com, blogs, jetpack
80.4M
80M
collection
eye 80.4M
A daily crawl of more than 200,000 home pages of news sites, including the pages linked from those home pages. Site list provided by The GDELT Project
Topics: GDELT, News
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Wed Oct 30 21:19:56 PDT 2013 to Wed Oct 30 15:58:29 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jun 27 02:35:05 PDT 2015 to Fri Jun 26 21:13:31 PDT 2015.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 01:48:56 PDT 2013 to Tue Sep 10 19:39:40 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Nov 4 01:08:35 PST 2013 to Sun Nov 3 18:31:38 PST 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Fri Nov 8 18:07:43 PST 2013 to Fri Nov 8 11:24:54 PST 2013.
Topics: no404, wordpress, crawldata
244,948
245K
Jun 5, 2016
06/16
by
Internet Archive
web
eye 244,948
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun Jun 5 12:10:10 PDT 2016 to Sun Jun 5 06:42:33 PDT 2016.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Oct 7 06:39:20 PDT 2013 to Mon Oct 7 01:07:00 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Oct 9 13:36:49 PDT 2013 to Wed Oct 9 07:59:25 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Oct 11 18:57:27 PDT 2013 to Fri Oct 11 18:27:42 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 18:12:47 PST 2013 to Fri Nov 8 11:19:28 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Jan 11 20:40:00 PST 2015 to Sun Jan 11 17:30:17 PST 2015.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Apr 13 23:31:41 PDT 2015 to Mon Apr 13 18:01:32 PDT 2015.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Dec 2 21:56:25 PST 2013 to Mon Dec 2 15:29:07 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jan 10 20:28:55 PST 2015 to Sat Jan 10 14:35:49 PST 2015.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Thu Mar 13 15:39:54 PDT 2014 to Thu Mar 13 10:29:53 PDT 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Nov 2 18:33:00 PDT 2013 to Sat Nov 2 13:05:27 PDT 2013.
Topics: no404, wikipedia, crawldata
194,822
195K
Jul 16, 2015
07/15
by
Internet Archive
web
eye 194,822
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 16 10:27:47 PDT 2015 to Thu Jul 16 04:43:26 PDT 2015.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 02:25:02 PDT 2013 to Tue Sep 10 20:16:15 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Nov 8 17:40:25 PST 2013 to Fri Nov 8 17:19:48 PST 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 02:06:46 PDT 2013 to Fri Oct 11 20:57:12 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Sep 26 10:30:15 PDT 2014 to Fri Sep 26 16:41:01 PDT 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 17:30:01 PST 2013 to Fri Nov 8 10:39:57 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 26 06:02:13 PDT 2014 to Sun Oct 26 00:44:36 PDT 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Tue Oct 28 01:11:50 PDT 2014 to Mon Oct 27 19:49:07 PDT 2014.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 05:10:05 PDT 2013 to Fri Oct 11 23:33:01 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Sep 22 23:03:50 PDT 2013 to Sun Sep 22 17:38:17 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Sep 27 21:12:16 PDT 2013 to Fri Sep 27 15:37:52 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 01:17:32 PDT 2013 to Fri Oct 11 19:35:18 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 25 13:38:01 PDT 2014 to Sat Oct 25 08:30:18 PDT 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Tue Dec 30 23:11:37 PST 2014 to Tue Dec 30 17:11:10 PST 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Dec 9 02:29:07 PST 2013 to Sun Dec 8 19:51:18 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 09:26:30 PDT 2013 to Wed Sep 11 03:51:47 PDT 2013.
Topics: no404, wordpress, crawldata
160,018
160K
Jul 30, 2015
07/15
by
Internet Archive
web
eye 160,018
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 30 02:42:20 PDT 2015 to Wed Jul 29 20:39:49 PDT 2015.
Topic: crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 03:09:48 PDT 2013 to Fri Oct 11 21:36:24 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 22:05:29 PDT 2013 to Sun Oct 13 16:25:33 PDT 2013.
Topics: no404, wikipedia, crawldata
157,943
158K
Jul 30, 2015
07/15
by
Internet Archive
web
eye 157,943
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 30 00:44:10 PDT 2015 to Wed Jul 29 18:56:39 PDT 2015.
Topic: crawldata
157,232
157K
Feb 3, 2016
02/16
by
Internet Archive
web
eye 157,232
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl892.us.archive.org:gdelt from Wed Feb 3 05:18:05 PST 2016 to Wed Feb 3 01:37:40 PST 2016.
Topic: crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 25 12:43:51 PDT 2014 to Sat Oct 25 07:04:01 PDT 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Sep 22 02:43:39 PDT 2013 to Sat Sep 21 21:49:05 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 04:03:47 PDT 2013 to Fri Oct 11 22:24:49 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 22:25:59 PDT 2013 to Sat Sep 21 18:13:45 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 23:53:08 PDT 2013 to Sat Sep 21 19:42:29 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Feb 15 10:36:00 PST 2015 to Sun Feb 15 05:04:15 PST 2015.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Thu Oct 10 04:04:33 PDT 2013 to Wed Oct 9 22:07:10 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 19:29:44 PST 2013 to Fri Nov 8 12:30:08 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 13:26:15 PDT 2013 to Sat Oct 12 07:59:33 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Sep 9 20:54:31 PDT 2013 to Mon Sep 9 22:15:49 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 02:57:31 PDT 2013 to Tue Sep 10 20:46:44 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl810.us.archive.org:wideaux from Mon Apr 4 10:54:40 PDT 2016 to Tue Apr 5 04:55:27 PDT 2016.
Topics: no404, search, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 12:19:50 PDT 2013 to Sat Sep 21 07:00:20 PDT 2013.
Topics: no404, wikipedia, crawldata
135,442
135K
Oct 2, 2015
10/15
by
Internet Archive
web
eye 135,442
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Oct 2 13:26:12 PDT 2015 to Fri Oct 2 07:16:23 PDT 2015.
Topic: crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 26 05:06:16 PDT 2014 to Sat Oct 25 23:40:30 PDT 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 07:38:37 PDT 2013 to Sat Oct 12 02:15:16 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Tue Mar 22 04:54:50 PDT 2016 to Mon Mar 21 23:42:16 PDT 2016.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Feb 2 10:16:09 PST 2014 to Sun Feb 2 04:00:31 PST 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 11:08:48 PDT 2013 to Sat Oct 12 06:01:41 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 01:00:19 PDT 2013 to Tue Sep 10 19:09:02 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Dec 15 11:40:44 PST 2013 to Sun Dec 15 05:32:44 PST 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Nov 4 03:20:28 PST 2013 to Sun Nov 3 20:09:08 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Thu Sep 26 18:48:25 PDT 2013 to Thu Sep 26 13:29:27 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Dec 8 22:11:01 PST 2013 to Sun Dec 8 16:34:40 PST 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Tue Oct 8 00:49:08 PDT 2013 to Mon Oct 7 18:50:02 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 06:01:08 PDT 2013 to Sat Oct 12 00:24:12 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 10:13:29 PDT 2013 to Sat Oct 12 04:53:58 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 12:22:18 PDT 2013 to Sat Oct 12 06:45:42 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 15:39:00 PDT 2013 to Sun Oct 13 10:13:45 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Fri Nov 8 17:19:15 PST 2013 to Fri Nov 8 10:31:17 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Oct 18 06:02:22 PDT 2013 to Fri Oct 18 00:35:25 PDT 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 23:06:38 PDT 2013 to Sun Oct 13 17:41:00 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Dec 20 12:03:20 PST 2014 to Sat Dec 20 05:24:49 PST 2014.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 07:51:48 PDT 2013 to Sun Oct 13 02:21:02 PDT 2013.
Topics: no404, wikipedia, crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Sep 22 21:59:29 PDT 2013 to Sun Sep 22 16:25:57 PDT 2013.
Topics: no404, wikipedia, crawldata